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The increasing genetic diversity of HIV-1 in 

the UK, 2002-2010 

The UK Collaborative Group on HIV Drug Resistance 

Objective: HIV-1 is typically categorized by genetically distinct viral subtypes. Viral 
subtypes are usually compartmentalized by ethnicity and transmission group and, thus, 
convey important epidemiological information, as well as possibly influencing the rate 
of disease progression. We aim to describe the prevalence and time trends of subtypes 
observed among key populations living with HIV-1 in the UK. 

Design: Analyses of reverse transcriptase and protease sequences generated from 
HIV-1 -positive antiretroviral-naive patients as part of routine resistance testing between 
2002 and 2010 in all public health and NHS laboratories in the UK. 

Methods: Subtype was assigned centrally using the SCUEAL algorithm. Subtyping 
results were combined with data from the UK Collaborative HIV Cohort Study and 
the UK HIV and AIDS Reporting System. Analyses adjusted for the number of national 
HIV-1 diagnoses made each year within demographic subgroups. Viral subtypes were 
described overall, over time and by demographic subgroup. 

Results: Subtype B diagnoses (39.9%) have remained stable since 2005, whereas 
subtype C diagnoses (34.3%) were found to decline in prevalence from 2004. Across 
most demographic subgroups, the prevalence of non-B non-C subtypes has increased 
overtime, in particular novel recombinant forms (9.9%), subtype G (2.7%), and CRF01 
AE (2.0%). 

Conclusion: HIV-1 subtypes are increasingly represented across all demographic 
subgroups and this could be evidence of sexual mixing. Between 2002 and 2010, 
the prevalence of novel recombinant forms has increased in all demographic sub- 
groups. This increasing genetic diversity and the effect of subtype on disease progression 
may impact future HIV-1 treatment and prevention. 
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Introduction 



Categorizing HIV-1 into genetic subtypes has been a 
powerful epidemiological tool to provide insights into 
patterns of HIV-1 transmission, with direct implications 
for public health policy. Genetically distinct viral strains 
are typically classified as one of nine distinct viral subtypes 
and mosaic viruses include 55 circulating recombinant 
forms (CRFs), unique recombinant forms (URFs) and 
other less well characterized complex structures [1,2]. 
Alongside epidemiological uses, genetically divergent 
strains have biological differences [3] , which may impact 



on the development of drug resistance [4], the 
susceptibility and response to antiretroviral therapy 
(ART) [5] and the rate of disease progression [5—7], 
thus potentially bearing on clinical care. 

The presence of non-B subtypes has been increasing in 
western Europe in recent years as a result of immigration 
from sub-Saharan Africa, Asia and eastern Europe [8]. 
This has led to multiple parallel epidemics within the 
UK, and subtypes are strongly associated with ethnicity 
and exposure group. In the UK, most HIV-1 group M 
subtypes and several CRFs are represented [9] and 
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subtype B virus, typically associated with MSM, is the 
most prevalent. The subtypes observed in the UK have 
been recently described for MSM [10] and, historically, 
for heterosexuals [11]. We aim to describe the prevalence 
and time trends of subtypes observed among major 
demographic subgroups affected by HIV-1 in the 
UK. 



Methods 

Resistance and clinical data 

The UK HIV Drug Resistance Database was established 
in 2001 and attempts to collect all resistance tests 
conducted within public health and NHS laboratories in 
the UK as part of routine clinical care. The resistance tests 
analysed in this study used bulk sequencing of the pol 
gene, encoding the protease and reverse transcriptase 
genes using a variety of in-house and commercial assays. 
Resistance tests conducted between 2002 and 2010 were 
used; from 2003 UK guidelines recommended that 
resistance testing be performed for all drug-naive patients 
prior to commencing treatment, and from 2005 [12J for 
all newly diagnosed patients. 

Subtype was assigned centrally based on the Subtype 
Classification Using Evolutionary Algorithms (SCUEAL) 
[13], which uses model-based phylogenetic methods to 
assign viral subtype. Previous studies have used the REGA 
algorithm [14]; however, SCUEAL has been shown to 
leave a smaller number of sequences as unclassified, which 
are of growing importance [13]. Viruses with mosaic 
genomes which do not yet fulfil the criterion to be 
established as new CRFs were classified as novel 
recombinants. Sub-subtypes such as Al and A2 or Fl 
and F2 were combined in our analysis, as were CRFs 
other than CRF01_AE, so that categories with small 
numbers could be avoided allowing changes over time in 
these groups to be analysed. 

Demographic data were acquired via annual linkage 
(using pseudo-anonymized identifiers such as clinic 
number, soundex and date of birth) to the UK 
Collaborative HIV Cohort Study (UK CHIC), which 
includes patients from 13 of the largest clinics within the 
UK [15], and to the HIV and AIDS Reporting System 
(HARS) [16], coordinated by Public Health England. 
These data include all new diagnoses reported nationally 
since the beginning of the epidemic and data from a 
national cohort of persons in care updated annually 
since 1997. 

Analysis 

The first ART-naive resistance sample for each patient in 
the UK HIV Drug Resistance Database conducted 
between 2002 and 20 1 0 was analysed, in which year refers 
to the date of sample. Our analyses examined the 



following demographic subgroups: black African women, 
black African heterosexual men, white heterosexual men, 
white heterosexual women, black African MSM, white 
MSM and other MSM. Within each year and demo- 
graphic subgroup, the proportion of resistance tests of 
each subtype was calculated. To adjust for the overall 
number of new diagnoses within the UK, these figures 
were then multiplied by the total number of new 
diagnoses [17] per year within each demographic 
subgroup to give the total number of diagnoses per year 
for each subtype. These temporal estimates were 
combined for an analysis of subtype by demographic 
subgroup and were combined across demographic 
subgroups for an analysis of overall temporal trends. 
More detailed analyses examined temporal trends by 
demographic subgroups and categorized a subtype as 
either subtype B, C or non-B, non-C to avoid categories 
with small numbers. Differences in subtype prevalence 
between groups were tested using the / test. All analyses 
were conducted in Stata/IC 12.1 (StataCorp LP, College 
Station, Texas, USA). 



Results 

Study population 

In total, 27 657 ART-naive resistance tests from 2002 to 
2010 were available for analysis; this represents 46% of the 
60 296 total diagnoses during this period. A total of 
21 945 (79%) resistance tests had an associated gender, 
ethnicity and probable exposure group; 11348 (52%) 
patients were MSM, 10 567 (48%) had a heterosexual 
exposure source and 30 had another exposure source; 
1 1 784 (54%) patients were white, 7374 (34%) were black 
African, 847 (4%) were black Caribbean, 458 (2%) were 
of another black ethnicity and 1482 (7%) had another 
ethnicity. A diagnosis date was available for 20 159 (92%) 
patients. Approximately, two-thirds of the resistance 
tests were conducted within 3 months of diagnosis 
(n= 13 385; 66%) and the median (interquartile range, 
IQR) time between diagnosis and resistance test date was 
25 (7—394) days. The number of tests conducted within 
3 months of diagnoses increased from 58% in 2002 to 75% 
in 2010, P< 0.001, reflecting national guidelines for 
resistance testing. 

Demographic information was missing for 5712 resistance 
tests, but by adjusting for national diagnoses, any bias in the 
differential collection of demographic information will 
have been minimized. Patients with missing information 
were less likely to have subtype B virus (n= 2708; 47.4% 
compared with n— 11354; 51.7%; P< 0.001) and more 
likely to have subtype C virus (n = 1628; 28.5% compared 
with h=5714; 26.0%; P< 0.001) than patients wherein 
this information was known. This suggests that probable 
exposure source information is more likely to be missing 
from transmission groups with a greater prevalence of 
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subtype C virus, such as the black African heterosexual 
population. 

To determine how representative our data was to the UK 
epidemic, we compared sites in our study where more 
than one third of resistance tests conducted had 
demographic data to those that did not. As a result of 
the large sample size, significant differences were found 
between these sites. There were significantly more male 
patients in the sites with more comprehensive demo- 
graphic data (61 vs. 59%; P< 0.001) and consequently 
there were also differences in the proportion of MSM 
(34 vs. 32%; P< 0.001) and black Africans (45 vs. 47%; 
P< 0.001). Overall, in absolute terms, these differences 
were felt to be small and unlikely to have a large impact on 
the prevalence of HIV-1 subtypes. 

Overall temporal trends 

As is typical of other resource-rich countries, the 
predominant viral subtype was found to be subtype B 
(n = 24040; 39.9%). Other common subtypes were C 
(k=20 678; 34.3%), A (k = 3006; 5.0%), novel recom- 
binants (k=5989; 9.9%), other CRFs («= 1860; 3.1%), 
D (n= 1521; 2.5%), G (n— 1636; 2.7%) and CRF01 AE 
(n— 1224; 2.0%). Table 1 presents the temporal trends for 
these main subtypes and Fig. 1 illustrates these. Major 
changes occurred between 2002 and 2005 as the 
estimated number of diagnoses per year increased from 
2223 to 2855 for subtype B and for subtype C increased 
from 2372 to 2933 followed by a decrease to 2629. After 
2006, the number of subtype B diagnoses remained 
relatively stable, whereas the number of subtype C 
diagnoses continued to decrease. 

The number of novel recombinant diagnoses increased 
from 461 to 742 per year between 2002 and 2010; the 
constituent subtypes found in these recombinants are 
shown in Table 3 (Supplementary Digital Content, http:// 
links.lww.com/QAD/A440) and these novel recombi- 
nants often contained fragments of subtypes G (n = 575), A 
(n = 566) and B (n = 434) . Due to the overall decline in the 
number of yearly HIV-1 diagnoses, this increase meant that 
novel recombinants grew as a proportion of diagnosed 
infections and by 2010 accounted for 13.3% of all new 
diagnoses. CRF01_AE diagnoses increased from 49 to 173 
per year between 2002 and 2006, and remained relatively 
stable after this. Other CRFs remained stable throughout 
the entire period studied and were mostly consisted of 
CRF02_AG {n = 250; 46% of other CRFs), CRF15_01B 
(n = 77; 14%) and CRF6_cpx («=69; 13%). Finally, the 
numbers of patients diagnosed with subtypes A and D 
declined disproportionately between 2002 and 2010, 
leading to a decrease in the overall proportion of infections 
with these subtypes. 

Subtype by demographic subgroup 

The subtype diversity within demographic subgroups is 
shown in Table 2. MSM were found to predominantly, 
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(a) (b) 




Other pure CRF01 AE — ♦ Other CRF Novel recombinant 



Fig. 1 . Trends in numbers of patients diagnosed by subtype between 2002 and 201 0. (a) Al I subtypes (b) non-B, non-C subtypes 
only (using different scale on y-axis). 



though not exclusively, have subtype B virus (n= 18 847; 
88.4%). Although MSM had less subtype diversity than 
other transmission groups, all subtypes were represented 
and there were a notable number of patients infected with 
novel recombinant strains (1200; 5.6%). Patients with 
heterosexual contact as a probable exposure source had a 
much greater variety of subtypes, although there were 
gender differences in this group. The most frequent 
subtype for both genders was subtype C, although a 
greater proportion of women (n= 12 789; 55.0%) have 
this subtype than men («=7331; 46.7%; P<0.001). 
Subtype B infections were more common in heterosexual 
men (n=2947; 18.8%) than women (2246; 9.7%; 
P<0.001). Subtype A was also frequent in those with 
heterosexual contact as well as a variety of CRFs. Finally, 
novel recombinants were frequent in both men and 



women with a probable heterosexual exposure source and 
occurred in greater numbers within heterosexuals than 
MSM reflecting historical differences in viral diversity 
between these populations. 

Temporal trends by demographic subgroup 

Temporal trends between 2002 and 2010 in subtype 
distribution by demographic subgroup are shown in 
Figs. 2 and 3 and Table 4 (Supplementary Digital 
Content, http://links.lww.com/QAD/A440) shows the 
subtype distribution among new diagnoses for hetero- 
sexual, black African or white, men and women. The 
total number of heterosexual black Africans diagnosed 
with HIV-1 fell from 2005/2006, which has similarly 
led to a decline in the total number of each subtype 
diagnosed. Trends were similar for black African men and 



Table 2. Subtype by probable exposure category, 2002-2010. 







Heterosexual contact 




Subtype 


Sex between men 


Men 


Women 


Total 


A 
B 
C 
D 
G 

Other pure 

CRF01_AE 

Other CRF 

Novel recombinants 

Total 


170; 0.8% (83) 
18 847; 88.4% (9975) 
558; 2.6% (305) 
38; 0.2% (17) 
70; 0.3% (37) 
71 ; 0.3% (44) 
151; 0.7% (95) 
215; 1.0% (122) 
1200; 5.6% (666) 
21 321; 35.4% (1 1 344) 


1013; 6.4% (235) 
2947; 1 8.8% (692) 
7331; 46.7% (1884) 
501; 3.2% (102) 
570; 3.6% (188) 

85; 0.5% (27) 
806; 5.1% (187) 
644; 4.1% (174) 
1816; 11.6% (534) 
15 714; 26.1% (4023) 


1822; 7.8% (478) 
2246; 9.7% (687) 
12 789; 55.0% (3525) 
983; 4.2% (195) 
995; 4.3% (325) 
157; 0.7% (51) 
267; 1.2% (109) 
1001; 4.3% (259) 
2973; 12.8% (949) 
23 234; 38.5% (6578) 


3006 (796) 
24 040 (11 354) 
20678 (5714) 
1521 (314) 
1636 (550) 
314 (122) 
1224 (391) 
1860 (555) 
5989 (2149) 
60 270 (21 945) 



Estimated total number of diagnoses; percentage of exposure category (total number of resistance tests). CRF, circulating recombinant form. 
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women with an increase in the proportion of patients 
diagnosed with non-B non-C subtypes and a decrease in 
the proportion of subtype C. Non-B non-C subtypes 
were diagnosed in 485 and 264 black African women and 
men, respectively, in 2010. The declining trend was most 



prominent for subtypes A, D and other CRFs but was 
partially offset by an increase in the absolute number and 
proportion infected with novel recombinant virus. For 
white heterosexual men and women, there are no clear 
time trends, although with limited data available for 
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Fig. 3. Trends in subtype over time among MSM, 2002-2009. Note different scales on y-axis by ethnic group. 
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2002—2004. White heterosexual women are roughly 
equally divided between subtype B, C and non-B non-C, 
whereas white heterosexual men have proportionately 
more subtype B infections (41.1% of overall infections) 
and proportionately fewer subtype C infections (22.5% of 
overall infections) compared with women (P<0.001). 

The number of HIV-1 infections diagnosed among MSM 
increased prior to 2007 and has remained high since then 
in all ethnic groups. This has been accompanied by an 
increase in the total number of non-B non-C subtypes 
diagnosed during this period. Substantial differences in 
the subtype make-up of diagnosed MSM by ethnicity are 
evident. White MSM and other MSM were chiefly 
diagnosed with subtype B, whereas black African MSM 
were estimated to have a mixture of subtype B (250; 
48.9%), C (85; 16.7%) and non-B, non-C (177; 34.6%). 
However, the number and proportion of non-B, non-C 
subtypes for white MSM increased markedly from 80 
(4.8%) in 2002 to 272 (12.4%) in 2010. This increase was 
largely driven by novel recombinant viruses, accounting 
for 8.0% of diagnoses among white MSM in 2010. Trends 
cannot be discerned for other ethnicities due to the small 
numbers diagnosed. 



Discussion 



This study found broad representation of HIV-1 subtypes 
within the UK, both overall and within specific 
demographic subgroups. Subtype should no longer be 
used as a proxy for transmission group when this is 
unknown, as has been the case historically, as heterosexual 
infections now account for 22% of all subtype B 
infections. The increase in non-B non-C subtypes, 
particularly novel recombinants among MSM of white 
and other ethnicity, demonstrates that there is ongoing 
emergence of new novel recombinant forms in the UK or 
that these variants are more transmissible. The number of 
patients diagnosed with subtype B virus has remained 
stable and is the most prevalent subtype in the UK, 
whereas the annual number of subtype C HIV-1 
diagnoses, has fallen since 2004, reflecting a decline in 
new HIV-1 diagnoses in the black African population. 
However, it remains the second most prevalent subtype 
overall and remains the most common subtype in the 
black African population. 

Previous UK studies [18] have also found 'an increasingly 
broad range of genetic diversity' and using phylogenetic 
surveillance showed that subtypes typically associated 
with infections acquired in East Africa were spreading 
among MSM in the UK. Gifford et at. [18] used 10 537 
sequences collected between 1996 and 2004, so our study 
represents a significant update on the changes in the 
prevalence of subtypes that have occurred since then, 
while also focusing on temporal trends within probable 



exposure groups in greater detail. Another UK study [10] 
examined the prevalence of non-B subtypes among MSM 
diagnosed from 1980 to 2007, and also found that 
nonsubtype B infections were increasing in the UK and 
suggested that the association of subtypes to socio- 
demographic groups was weakening. 

A recent meta-analysis [19] of global data, with 65 913 
samples, has concluded that there has been an increase in 
the proportion of the subtypes A, F, G, H, CRF01 AE and 
CRF02 AG and that the global proportion of all CRFs 
has also grown, but did not describe this by demographic 
subgroup. However, URFs decreased as a proportion of 
infections by 3.1% between 2000-2003 and 2004-2007. 
Abecasis et al. [20] used data from the SPREAD European 
surveillance programme in 2002—2005 and found that 
subtype was highly determined by demographic sub- 
groups and suggested highly compartmentalized, parallel, 
epidemics but only CRF02_AG and subtype F had 
significant time trends. Our more recent data suggest that 
the effect of increased migration to the UK has led to a 
very different epidemic to the rest of Europe, although 
the compartmentalization found in SPREAD may have 
also broken down since 2005. Other studies [21—23] have 
also observed increasing genetic diversity demonstrating 
that this is not a phenomenon unique to the UK. 

Our study population comprises patients who received a 
drug resistance test and may not therefore be representa- 
tive of the entire HIV-1 -positive population in the UK. 
We tried to minimize this bias by only using tests 
conducted after 2001 and by accounting for differential 
use of resistance testing by ethnicity and exposure group. 
However, we could not adjust for the 24% of the HIV-1- 
positive population estimated to not yet be aware of their 
infection [24]. As a consequence, we may be under- 
reporting some non-B subtypes as previous research has 
shown that non-African born heterosexuals have a higher 
undiagnosed rate of infection compared with MSM (31 
compared with 20% [24]). For our temporal trend 
analyses, we used the year of sampling for the resistance 
test, which might not reflect the date of seroconversion, 
although this should not change our main conclusions. It 
would have been interesting to have conducted further 
analyses comparing UK acquired infections to those 
acquired abroad. An indirect method for estimating 
country of infection, based on year of migration and 
CD4 + cell count at diagnosis, was recently published [25] 
and showed that 33% of heterosexuals born abroad 
acquired their infection in the UK. However, a similar 
analysis was not possible in this study due to incomplete 
data on these two variables. 

The choice of SCUEAL as the automated subtyping 
tool could have influenced our findings compared to 
other studies, which have generally used the REGA v2 
algorithm. A comparison between the subtype assigned 
by SCUEAL and REGA v2 to sequences in this study is 
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shown in Table 6 (Supplementary Digital Content, 
http://links.lww.com/QAD/A440). Consistent with 
another analysis [26], there was strong concordance 
between the pure subtypes; the main difference between 
the algorithms is that viruses classified as CRF02_AG by 
REGA are usually classified as either G, A/G recombi- 
nants with alternate breakpoints, or as a complex subtype, 
by SCUEAL. The estimated prevalence of these subtypes 
would, therefore, have been affected accordingly. Our 
analysis uses partial sequences of the pol gene, comprising 
approximately 10% of the viral genome, as this is the 
region typically sequenced for resistance testing. This 
results in underestimates of the prevalence of novel 
recombinant forms as mosaic viruses may have break- 
points outside of this region. This bias cannot currently be 
quantified, but the increasing availability of full length 
sequences from next generation sequencing platforms 
will allow future quantification of this effect. 

The increasing diversity of HIV-1 could be viewed as 
evidence that any fitness differences between subtypes are 
likely to be small. The fact that distinct demographic 
subgroups have overlapping subtypes suggests sexual 
linkage between different subgroups, although this can 
only be confirmed by conducting further phylogenetic 
analyses. The increase in novel recombinants is further 
evidence that sexual linkage between demographic 
subgroups is occurring for individuals to be simul- 
taneously infected with genetically distinct viruses. The 
impact this increase in novel recombinants will have on 
disease progression, treatment and the development of 
drug resistance is not yet known. 
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